Physics-Informed Generative Editing for Realistic Video Synthesis from Natural Language
dev.toยท2hยท
Discuss: DEV
๐Ÿง Learned Codecs
LLM Rerankers for RAG: A Practical Guide
fin.aiยท19hยท
๐Ÿ”Information Retrieval
WhisTLE: Deeply Supervised, Text-Only Domain Adaptation for Pretrained Speech Recognition Transformers
arxiv.orgยท13h
๐ŸŽ™๏ธWhisper
HugeTracker Part 6: Subpatterns
gbstudiocentral.comยท4h
๐ŸŽ›๏ธAudio Synthesis
Pre-viva Talk - 02/10/2025
informatics.ed.ac.ukยท4h
๐Ÿค–Grammar Induction
Vergilian - The speech coach
dev.toยท1dยท
Discuss: DEV
๐ŸŽ™๏ธWhisper
[P] Convolutional Neural Networks for Audio -- the full story behind SunoAI
reddit.comยท1dยท
๐ŸŽงLearned Audio
Mixed Excitation Linear Predictive (MELP) Vocoders
melpe.orgยท11hยท
Discuss: Hacker News
๐ŸŽงLearned Audio
Learn How to Use Transformers with HuggingFace and SpaCy
towardsdatascience.comยท3h
๐ŸŽฏDependent Parsing
CoDiCodec: Unifying Continuous and Discrete Compressed Representations of Audio
arxiv.orgยท13h
๐ŸŽงLearned Audio
Building a Hands-Free AI Fitness Applet with Gemini Live API
dev.toยท1dยท
Discuss: DEV
๐ŸŽ™๏ธWhisper
Testing chatbots on the creation of encoders for audio conditioned image generation
arxiv.orgยท13h
๐Ÿง Learned Codecs
Semantic Dictionary Encoding
falvotech.comยท2hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
Turning Music Into Art โ€” Building a Synesthesia Simulator with Gemini
dev.toยท22hยท
Discuss: DEV
๐ŸŽงLearned Audio
Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Concrete Syntax
MultimodalHugs: Enabling Sign Language Processing in Hugging Face
arxiv.orgยท13h
๐Ÿ›Digital humanities
Show HN: I made an app that turns scripts to videos in minutes
kliptory.comยท11hยท
Discuss: Hacker News
๐Ÿ—œ๏ธLZW Variants
Transform Lectures into Summaries, Questions, and Blog Ideas with Lecture lab AI
dev.toยท13hยท
Discuss: DEV
๐Ÿ›Digital humanities
CurioShorts
dev.toยท13hยท
Discuss: DEV
๐ŸŒ€Brotli Internals